Dataset-invariant covariance normalization for out-domain PLDA speaker verification
نویسندگان
چکیده
In this paper we introduce a novel domain-invariant covariance normalization (DICN) technique to relocate both in-domain and out-domain i-vectors into a third dataset-invariant space, providing an improvement for out-domain PLDA speaker verification with a very small number of unlabelled in-domain adaptation i-vectors. By capturing the dataset variance from a global mean using both development out-domain i-vectors and limited unlabelled in-domain i-vectors, we could obtain domaininvariant representations of PLDA training data. The DICNcompensated out-domain PLDA system is shown to perform as well as in-domain PLDA training with as few as 500 unlabelled in-domain i-vectors for NIST-2010 SRE and 2000 unlabelled in-domain i-vectors for NIST-2008 SRE, and considerable relative improvement over both out-domain and in-domain PLDA development if more are available.
منابع مشابه
Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non-target (or outdomain) data significantly affects the speaker verification performance due to the domain mismatch between training and evaluation data. To improve the speaker verification performance, sufficient amount of domain mismatch compensated out-domain data must be used to train the PLDA ...
متن کاملDomain adaptation based Speaker Recognition on Short Utterances
This paper explores how the inand out-domain probabilistic linear discriminant analysis (PLDA) speaker verification behave when enrolment and verification lengths are reduced. Experiment studies have found that when full-length utterance is used for evaluation, in-domain PLDA approach shows more than 28% improvement in EER and DCF values over out-domain PLDA approach and when short utterances a...
متن کاملPLDA based speaker recognition on short utterances
This paper investigates the effects of limited speech data in the context of speaker verification using a probabilistic linear discriminant analysis (PLDA) approach. Being able to reduce the length of required speech data is important to the development of automatic speaker verification system in real world applications. When sufficient speech is available, previous research has shown that heav...
متن کاملCompensating Inter-Dataset Variability in PLDA Hyper-Parameters for Robust Speaker Recognition
Recently we have introduced a method named inter-dataset variability compensation (IDVC) in the context of speaker recognition in a mismatched dataset. IDVC compensates dataset shifts in the i-vector space by constraining the shifts to a low dimensional subspace. The subspace is estimated from a heterogeneous development set which is partitioned into homogenous subsets. In this work we generali...
متن کاملVariance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis
I-vector extraction and Probabilistic Linear Discriminant Analysis (PLDA) has become the state-of-the-art configuration for speaker verification. Recently, Gaussian-PLDA has been improved by a preliminary length normalization of i-vectors. This normalization, known to increase the Gaussianity of the i-vector distribution, also improves performance of systems based on standard Linear Discriminan...
متن کامل